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Abstract — The main focus of space-time coding design and 
analysis for MIMO systems has been so far focused on single- 
user systems. For single-user systems, transmit diversity schemes 
suffer a loss in spectral efficiency if the receiver is equipped with 
more than one antenna, making them unsuitable for high rate 
transmission. One such transmit diversity scheme is the cyclic 
delay diversity code (CDD). The advantage of CDD over other 
diversity schemes such as orthogonal space-time block codes 
(OSTBC) is that a code rate of one and delay optimality are 
achieved independent of the number of transmit antennas. In 
this work we analyze the ergodic rate of a multi-user multiple 
access channel (MAC) with each user applying such a cyclic delay 
diversity (CDD) code. We derive closed form expressions for the 
ergodic sum-rate of multi-user CDD and compare it with the 
sum-capacity. We study the ergodic rate region and show that in 
contrast to what is conventionally known regarding the single- 
user case, transmit diversity schemes are viable candidates for 
high rate transmission in multi-user systems. Finally, our theo- 
retical findings are illustrated by numerical simulation results. 



I. Introduction 

Multiple-input multiple-output or MIMO wireless systems 
have received a significant amount of interest due to their 
capability of dramatically increasing the capacity of a com- 
munication link [2], [3]. Also, there has been considerable 
work on a variety of schemes which exploit multiple antennas 
at both the transmitter and receiver in order to obtain spatial 
diversity, i.e. to improve the reliability of the system such 
as orthogonal space-time block codes (OSTBC) [4], [5] and 
space-time ttellis codes [6] (STTC). An simple example for 
a STTC is the delay diversity code, first proposed in [7] 
and also discussed in [6], [8], where the data stream on the 
first transmit antenna is transmitted with delays on the other 
antennas. The delay diversity code achieves full diversity but 
has a disadvantage of a slight rate loss due to some leading 
and tailing zeros. In order to avoid this drawback, cyclic delay 
diversity (CDD) has been proposed in [9]. 

The main advantage of CDD over OSTBC is that a code 
rate of one is achieved independent of the number of transmit 
antennas, whereas OSTBC suffer a rate loss by increasing the 
number of transmit antennas [10]— [12]. The performance of 
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CDD in terms of error probability was analyzed in [9] for 
frequency flat and frequency selective channels. The diversity- 
multiplexing tradeoff for delay diversity was characterized 
in [13]. Based on the assumption that statistical information 
about the channel is available at the transmitter, transmit filters 
employed at the transmitter are optimized in [14] in order to 
minimize the Chernoff bound on the error probability of CDD. 
The average rate for CDD by assuming Gaussian as well as 
PSK/QAM input signals for a point-to-point transmission with 
tit = 2 or Ut = 4 transmit antennas and tir < 2 receive 
antennas was investigated in terms of Monte-Carlo simulations 
in [15]. 

As noticed in [16], most of the work on space-time coding 
so far is focused on single-user systems. For such single- 
user systems, it was shown that the loss in terms of spectral 
efficiency for transmit diversity schemes increases significantly 
for more than one receive antenna [17], making them inappro- 
priate for high rate transmission. In this work, by assuming a 
Gaussian codebook, we analyze the ergodic rate performance 
of a multiple access channel (MAC) system with multiple users 
transmitting their data to the base station by using a CDD. 
At the receiver, we apply a joint maximum-likelihood (ML) 
detector. We compare the achieved ergodic sum-rate with the 
sum-capacity of a MAC system assuming that the users do not 
have channel state information (CSI) and the base station has 
full CSI. We show that, if the number of receive antennas ur 
is less than the number of users K, i.e. ur < K in the system 
(which is a reasonable assumption), the loss of the proposed 
scheme in terms of spectral efficiency is negligible. This is in 
strong conttast to single-user systems [15], [17]. Thus, CDD is 
indeed an attractive candidate for transmit strategies in MIMO 
multi-user systems. 

The remainder of the paper is organized as follows. In 
Section|n] we introduce the system model and the construction 
of the CDD. The performance of CDD in terms of spectral 
efficiency is then analyzed and compared to the sum-capacity 
in sectionQlIl followed by simulations and concluding remarks 
in Section HVl and N\ 

Notational conventions are as follows. We will use bold- 
faced upper case letters to denote matrices, e.g., X, with 
elements Xij\ bold-faced lower case letters for column vec- 
tor, e.g., x, and light-faced letters for scalar quantities. The 
superscripts (-) T and (-) H denote the transpose and Hermitian 




Fig. 1. Multiuser MIMO multiple access channel (MAC) with users 
MS 1 !,. . . , MSk transmitting data to base station (BS). 



operations, respectively. Finally, the identity matrices and all 
zero vectors of the required dimensions will be denoted by I 
and T . E[ ] will denote the expectation operator. 

II. System model 

In this work, we consider a multiuser multiple access 
channel (MAC) with K users each with tit transmit antennas 
and tir receive antennas at the base station as depicted in 
Fig.CD 

The system model is given as 
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SNR 
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N, 



(1) 



where SNR denotes the signal-to-noise ratio, is the trans- 
mit matrix of size [T x tit], T is the block length and N 
([Txtir]) is the additive white Gaussian noise (AWGN) matrix 
N ~ CAf(0, 1). In this work, we focus on delay optimal codes, 
in the sense that the decoding is performed after T channel 
uses with Ut = T [5], [10]. The channel between user k, 
1 < k < K, and the base station is modeled by a random 
zero-mean component channel ([?i_rX71t]) with identically 
independent distributed (iid) complex Gaussian entries, i.e. 
Hfc ~ CAf(0, 1), 1 < k < K. We assume that each user k 
is applying a cyclic delay diversity scheme (for simplicity we 
skip the user index) , given as 



G fc (x fc ) = G(x) 



where x k = [x 1 ,x 2 , 
symbols of user k. 
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,Xt] t is the vector of transmitted 



III. Performance analysis 

In the following, we analyze the ergodic sum-rate perfor- 
mance achieved with the proposed scheme and compare it 
to the ergodic MAC sum-capacity. We start with the case, 



where only one antenna is available at the base station, i.e. 
titi = 1, and afterwards generalize it to higher number 
of receive antennas. The separation of these two cases is 
motivated by the fact that the spectral efficiency for transmit 
diversity schemes in single-user systems with one receiving 
antenna is close to (e.g. QSTBC) or even equal to capacity 
(Alamouti scheme) [17]. With more than one receive antenna, 
this behavior changes significantly to the disadvantage of the 
transmit diversity schemes. As we will see later on, for multi- 
user systems the picture looks different. 

A. tir = 1 antenna at the base station 

For riR = 1 at the base station, the system equation in (Q]) 
reduces to 
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with hfc of size [1 x ut), which can be rewritten as 
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with where the circulant matrices Hfe, 1 < k < K, 
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are the effective channels between user k and the base station, 
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,x£F is 



obtained by stacking the transmit signals of the users into 
one large vector. The instantaneous rate achievable with the 
circulant code is then given as 

1 , / SNR -„- 

Ir. = — log 2 dot I T + 
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Since the H^, 1 < k < K, are all circulant matrices, they 
can be simultaneously diagonalized by the unitary DFT matrix 
D [18], i.e. H fc Hf = Dn T A fc D ff . Therefore, I c results in 
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where the A t are i.i.d. chi-square distributed random variables 
with 2K degrees of freedom, i.e. X t ~ X2K- F° r ^ e er g°dic 
sum-rate performance given as R c = E[J C ], we thus have 



J>g 2 (1 + SNRA t ) 



E[log 2 (l + SNRA)] 



where A ~ xiif ■ I n order to derive a lower bound on the above 
expression, we make use of the fact that log 2 (1 + ae x ) is a 
convex function with a > 0, thus applying Jensen's inequality 
results in 

R c > log 2 (1 + SNRcxp (E [log 2 (A)])) . 
From [19], [20] we know that 
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(3) 



where ?p(-) is the digamma or psi function [21, p. 943, 
eq.8.360] and 7 f=a 0.577 is Euler's constant. Thus, for tir = 1 
receive antenna at the base station, the average sum-rate 
performance of the proposed scheme is lower bounded by 



B. Arbitrary number of receive antennas Hr 

For higher number of receive antennas the effective (some- 
times also referred to as equivalent) channel to each user k is 
given as 



H, 



where 
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Rc > log 2 1 + SNRexp 




(4) 



In contrast, note that the sum-capacity of the multiuser MAC 
system can be lower bounded by 



C > log 2 1 
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The result in (0 is obtained by applying the techniques in [19] 
and the fact that for no CSIT the ergodic sum capacity of 
a K users MAC channel, where each user has tit transmit 
antennas, is equivalent to the ergodic capacity of a single- 
user system with Kut transmit antennas [22, Proposition 1], 
[2]. By comparing and (O, we observe that the difference 
between the ergodic sum-rate achieved with CDD and the sum- 
capacity, i.e. C — R c converges to 

^ n T K-l ^ 

_ £ --1„(„ T ) 

for high SNR. This result is restricted to high SNR, where the 
lower bounds in (0]i and ^ are tight and thus can be used for 
computing the difference C — R c . Note that 2fc=fc+i \ 
nondecreasing function in K and tit- Further, note that 

utK — 1 utK — 1 K ^ 

lim - = lim T ~ ^ 7 
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= lim ij}(n T K ) - ij)(K + 1) = ln(n T ). 



(7) 



Thus, for rin ; = 1 the difference in (O, i.e. C — R c , is upper 
bounded by 



C-R c < 



K\n(2) 



(8) 



for high SNR, i.e. the more the number of users the less is the 
difference between the actual capacity and the rate achieved 
with CDD. 



with 
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being a [n^nT x utK] matrix. Thus, H is a block matrix 
with circulant blocks. Similarly to the single receive antenna 
case, the instantaneous rate with arbitrary Ut, ur and K users 
achievable with the circulant code is then given as 

h = ^ log 2 det (ln RnT + — HH fl 
We are now able to state the following theorem. 



(10) 



Theorem 1 The ergodic-sum rate of a multi-user MAC 
system with K users each equipped with tit transmit antennas 
applying a CDD and tir receive antennas at the base station 
is lower bounded by 



L / /M-l 

Rc > log 2 1 + SNRcxp ]T - 

1=1 V \fc=i 

with L = min(nij, K) and M = max(nij, K). 



(11) 



Proof: : The proof is given in the Appendix. 
In comparison, using similar techniques the sum-capacity 
of the multiuser MAC system with K users, each with tit 
transmit and n# receive antennas can be lower bounded by 
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with L = mm(nn, tltK), M = max(n#, utK). 

By applying Jensens inequality, the lower bound on R c 
in ( ITTb and the lower bound for C in ( fT2l may be bounded 
from below by 

R c >L\og 2 ^l + SNRcx P ^i^^ i- 7 ^ (13) 



and 
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respectively. These new lower bounds get tight for high 
SNR [19]. Assume that tir < K, which is a reasonable 
assumption. Then, applying similar steps as in ©- @, us- 
ing ( TT~3T > and < TT~4T > the difference C—R c may be upper bounded 
by 
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for high SNR. Thus, increasing K reduces the difference C — 
R c , while increasing ur increases the difference C — R c . 

In addition to the lower bound, an upper bound is also 
derived following the approach of [20]. Applying Jensen's 
inequality to ( f2TT > results in 

R c < log 2 E [dct (I + SNRH'jH'^)] 

due to the concavity of the logarithm. With [20, Theorem A.4, 
eq.(A.ll)], we arrive at 
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In the following section, our theoretical results are illustrated 
by numerical simulations. 

IV. Simulations 

A. Single-User case 

In Fig. [2] the ergodic rate R c achievable with the CDD 
and the ergodic capacity for a single-user system are depicted 
with nx = 4 transmit and ur = 1 and ur = 2 receive 
antennas, respectively. In addition to this, the lower bound on 
R c in (fTTl i derived in the previous section is also depicted. 
From the figure, we observe that for ur = 1, R c is close 
to the capacity. By increasing tir, we observe that the loss 
increases significantly. The lower bound on R c confirms this 
behavior and verifies the numerical results obtained in [15], 
[17] that transmit diversity schemes perform poorly for hr > 
1 in single-user systems. 

B. Multi-User case 

In Fig. [3] the ergodic capacity region of a symmetric 
system with K = 2 users each equipped with nr = 2 
transmit antennas and a base station with ur ~ 2 receive 
antennas is depicted for three different SNR values ranging 
from SNR = dB to SNR = 40 dB. In addition to that, 
the ergodic rate region achievable with CDD is depicted. The 
corner points of the regions can be achieved by a successive 
cancelation decoder. Note that the capacity and the maximum 
rate achievable with CDD of the point-to point link with the 
other user absent from the system (or equivalently completely 
canceled) are given by 
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Fig. 2. Single-user ergodic rate of th e CD D and capacity, ur 
tit = 4. Lower bounds obtained from II U are also depicted. 
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respectively, where i ([tir x 1]) is the ith column of the 
hr x K matrix in ( 1211 . The rate of the individual user 
cannot exceed these bounds, which are referred to as single- 
user bounds with R l c + R 2 C = C and C\ + C2 — C From the 
figure, we observe that the gap between the single-user bounds, 
e.g. Ci — R\, is significantly large. An interesting observation 
is, however, that the corner points of the capacity region are to 
the left or below the corner points of the ergodic rate region 
of CDD. Thus, at the so called Pareto optimal part [24] of 
the rate region which is the time-sharing part between the two 
corner points, the gap between the symmetric CDD rate region 
to the capacity region is dramatically reduced. The Pareto 
optimal part contains all the optimal operating points of the 
channel [24, Ch.6,p.231]. 

In Fig. 21 we depict for a system with K = 6 users and 
n r = 71 r = 3 antennas from top to bottom (at the most 
right): the ergodic capacity, the lower bound on the ergodic 
capacity in (fT2l (very tight), the upper bound on the ergodic 
sum-rate of the CDD R c in (1161 . R c , and the lower bound 
on R c in (fTTl i discussed in the previous sections. Differently 
from the single-user system, most of the ergodic capacity is 
obtained by the CDD despite multiple receiving antennas at 
the base station. Both the lower and upper bound on R c track 
the ergodic sum-rate performance of the CDD quite well. 

V. Conclusion 

Research in the area of space-time design and analysis 
for MIMO systems has traditionally focused on single-users 
scenarios. For these systems, transmit diversity schemes suffer 
a loss in spectral efficiency when the receiver has more than 
one antenna, making them unsuitable for high rate transmis- 
sion. Here, we focused on the performance of cyclic delay 
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Fig. 3. Ergodic Capacity Region (dashes lines) and ergodic rate region of the 
CDD (solid lines) with K = 2 users, n R = 2 and n T = 2 for SNR = OdB, 
SNR = 20dB and SNR = 40dB. 
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Fig. 4. Ergodic sum-rate of the CDD and sum-capacity with K = 6 
user s, nn = 3 and tit = 3. Upper and lower bounds obtained from U61 
and < 1 2H 1 It . respectively, are also depicted. 



diversity (CDD) codes. We analyzed the ergodic sum-rate and 
studied the ergodic rate region achievable by using the CDD 
for each user in a multi-user multiple access channel (MAC). 
We derived tight closed form expressions for the ergodic 
sum-rate of multi-user CDD and compared it with the sum- 
capacity. We showed that in contrast to the single-user case, 
transmit diversity schemes are viable candidates for high rate 
transmission in multi-user systems when the number of users 
exceeds the number of antennas at the base. 



Appendix 

A. Proof of Theorem Q] 

Proof: The expression in ( fTQb may be rewritten as 

I c =% log 2 det f I + — (I nR ® D) HH ff (I„ R ® D) ff 



n T HH [ 
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where D is the unitary DFT-matrix of appropriate size of [tit x 
i%t] with DD ff = I and ® denotes the Kronecker product 
between two matrices. According to [18, p. 183], the matrix 
HH fl is a [rtftjiT x tirjit] block matrix with diagonal blocks 
of size [tit x tit]- The non-zero entries of the matrix H are 
each CAf(0, 1) distributed. 

Let us now multiply the matrix fLH H in (fT9b from right 
and left with a [n^riT X njinx] permutation matrix P and 
P H , respectively. It holds that PP H = I. 

Example .1 For tit = 4, tir = 2 and K = 2, P is given as 
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Thus, we have 
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= f E l0 §2 dct (!n R + SNRH^Hf ) , (20) 

i=l 

where H' = diag (H' 1; H 2 , . . . , ) is a block diagonal 
matrix, with tit blocks of size [tir x K] and each block is 
CAf (0,1) distributed. Eq. (f20b can be interpreted as the sum- 
rate of tit parallel MIMO channels each with n r receive and 
K transmit antennas, divided by the block length of T. Taking 
the expectation of ( 1201 ) with T = rvp results in 



R c = Elog 2 det (I + SNRH^H']" 



(21) 



Following the same approach as of [19], and using the lower 
bound on mutual information from [3, eq.(12)] that 

min(n/i,AT) 

I> ]T log 2 (l + SNRX J ), 

3=1 

where the X j - xl {maAnR . K )- ]+ iy we a PP!y Jensen's 
inequality with 

E [log 2 (1 + aexp(x))} > log 2 (1 + aexp(E [as])) 

for a > to (f2Tb . Furthermore, by using ^ we arrive at the 
lower bound for R c given in ( fTTT i. ■ 
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